Accelerated Dual Learning by Homotopic Initialization

نویسندگان

  • Hadi Daneshmand
  • Hamed Hassani
  • Thomas Hofmann
چکیده

Gradient descent and coordinate descent are well understood in terms of their asymptotic behavior, but less so in a transient regime often used for approximations in machine learning. We investigate how proper initialization can have a profound effect on finding near-optimal solutions quickly. We show that a certain property of a data set, namely the boundedness of the correlations between eigenfeatures and the response variable, can lead to faster initial progress than expected by commonplace analysis. Convex optimization problems can tacitly benefit from that, but this automatism does not apply to their dual formulation. We analyze this phenomenon and devise provably good initialization strategies for dual optimization as well as heuristics for the non-convex case, relevant for deep learning. We find our predictions and methods to be experimentally well-supported.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Acupuncture at homotopic acupoints exerts dual effects on bladder motility in anesthetized rats

BACKGROUND In Chinese medicine, dual effects on target organs are considered a primary characteristic of acupoint. Acupoints may be classified as heterotopic or homotopic in terms of spinal segmental innervation: homotopic acupoints contain afferent innervation in the same segment from which efferent fibers innervate target visceral organs, and heterotopic acupoints utilize different spinal seg...

متن کامل

Raising Distillate Selectivity and Catalyst Life Time in Fischer-Tropsch Synthesis by Using a Novel Dual-Bed Reactor

In a novel dual bed reactor Fischer-Tropsch synthesis was studied by using two diffrent cobalt catalysts. An alkali-promoted cobalt catalyst was used in the first bed of a fixed-bed reactor followed by a Rutenuim promoted cobalt catalyst in the second bed. The activity, product selectivity and accelerated deactivation of the system were assessed and compared with a conventional single bed r...

متن کامل

SW-ELM: A summation wavelet extreme learning machine algorithm with a priori parameter initialization

Combining neural networks and wavelet theory as an approximation or prediction models appears to be an effective solution in many applicative areas. However, when building such systems, one has to face parsimony problem, i.e., to look for a compromise between the complexity of the learning phase and accuracy performances. Following that, the aim of this paper is to propose a new structure of co...

متن کامل

Accelerated Mini-Batch Stochastic Dual Coordinate Ascent

Stochastic dual coordinate ascent (SDCA) is an effective technique for solving regularized loss minimization problems in machine learning. This paper considers an extension of SDCA under the minibatch setting that is often used in practice. Our main contribution is to introduce an accelerated minibatch version of SDCA and prove a fast convergence rate for this method. We discuss an implementati...

متن کامل

Stable Distribution Alignment Using the Dual of the Adversarial Distance

Learning to align distributions by minimizing an adversarial distance between them has recently achieved impressive results. However, such models are difficult to optimize with gradient descent and they often do not converge without very careful parameter tuning and initialization. We investigate whether turning the adversarial min-max problem into an optimization problem by replacing the maxim...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • CoRR

دوره abs/1706.03958  شماره 

صفحات  -

تاریخ انتشار 2017